6 . 863 Final Project Writeup
نویسنده
چکیده
The goal of this system is to provide a measure of semantic similarity between any two verbs. The VerbOcean dataset contains various semantic relationships for a set of 3477 unique verbs. The pairwise relationships between verbs contained in VerbOcean were used to create a graph of semantic relations between many verbs. A tree traversal algorithm was implemented so that, given a starting word and a target word, various semantic paths from the starting word to the target word could be found. The VerbOcean dataset was generated by searching for the number of web pages which contained pairs of verbs in various simple syntaxes. The relationships incorporated into VerbOcean are similarity, strength, antonymy, enablement, and precedence. Similarity between two verbs was found by searching for instances of ‘to X and Y’ or ‘Xed and Yed’, while precedence was found by searching for ‘to X and then Y’ or ‘Xed and then Yed’. The number of results found (using Google) corresponded to the strength of a particular relationship between two verbs. The score of the shortest semantic path between two verbs intuitively gives a measure of how closely related those two verbs should be.
منابع مشابه
CS380L Project Writeup: Distributed Completion Service
Task parallelism is difficult to implement in a distributed setting due to machine unreliability and communication latency. HTCondor, an existing distributed computation framework, is insufficient for addressing these shortcomings. In this report, we present a high level abstraction built on top of HTCondor called the Distributed Completion Service (DCS). The DCS uses multiple different methods...
متن کاملProcess and Pitfalls in Writing Information Visualization Research Papers
The goal of this paper is to help authors recognize and avoid a set of pitfalls that recur in many rejected information visualization papers, using a chronological model of the research process. Selecting a target paper type in the initial stage can avert an inappropriate choice of validation methods. Pitfalls involving the design of a visual encoding may occur during the middle stages of a pro...
متن کامل